Bounds on the Number of Longest Common Subsequences

نویسنده

  • Ronald I. Greenberg
چکیده

This paper compares the standard, naive method of computing all distinct longest common subsequences, or of computing all LCS embeddings, to more efficient methods with running time comparable to the output size. Included in this analysis is a determination of the maximum possible number of LCS embeddings in two input sequences, as a function of the total number of input characters. Also included are bounds on the maximum possible number of distinct LCSs. In addition, much of the analysis carries over to cases in which the two input sequences are of the same specified length or of two independently specified lengths.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Longest common subsequences and the Bernoulli matching model: numerical work and analyses of the r-reach simplification

The expected length of longest common subsequences is a problem that has been in the literature for at least twenty five years. Determining the limiting constants γk appears to be quite difficult, and the current best bounds leave much room for improvement. Boutet de Monvel explores an independent version of the problem he calls the Bernoulli Matching model. He explores this problem and its rel...

متن کامل

Computing the Number of Longest Common Subsequences

This note provides very simple, efficient algorithms for computing the number of distinct longest common subsequences of two input strings and for computing the number of LCS embeddings.

متن کامل

On the Constrained Longest Common Subsequence Problem

The problem of the longest common subsequence is a classical distance measure for strings. There have been several attempts to accommodate longest common subsequences along with some other distance measures. There are a large number of different variants of the problem. In this paper, we consider the constrained longest common subsequence problem for two strings and arbitrary number of constrai...

متن کامل

On the Convergence of Upper Bound Techniques for the Average Length of Longest Common Subsequences

It has long been known [2] that the average length of the longest common subsequence of two random strings of length n over an alphabet of size k is asymptotic to γkn for some constant γk depending on k. The value of these constants remains unknown, and a number of papers have proved upper and lower bounds on them. In particular, in [6] we used a modification of methods of [3, 4] for determinin...

متن کامل

All Common Subsequences

Time series data abounds in real world problems. Measuring the similarity of time series is a key to solving these problems. One state of the art measure is the longest common subsequence. This measure advocates using the length of the longest common subsequence as an indication of similarity between sequences, but ignores information contained in the second, third, ..., longest subsequences. I...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره cs.DM/0301030  شماره 

صفحات  -

تاریخ انتشار 2003